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Abstract 


For  a  random  sequence  of  events,  with  indicator  variables  X^,  the  be- 

rX.+...*X.  .} 

havior  of  the  expectation  E  — %’~+~*I~+x -  ^or  1  s  k  s  k**-l  s  n  can  be  taken 

1  •"  n 

as  a  measure  of  clustering  of  the  events.  When  the  measure  on  the  X's  is 
i.i.d. ,  or  even  exchangeable,  a  symmetry  argument  shows  that  the  expectation 
can  be  no  more  than  m/n.  When  the  X's  are  constrained  only  to  be  a  stationary 
sequence,  the  bound  deteriorates,  and  depends  on  k  as  well.  When  m/n  is  small, 
the  bound  is  roughly  2o/n  for  k  near  n/2  and  is  like  (m/n)  log  n  for  k  near 
1  or  n.  The  proof  given  is  partly  constructive,  so  these  bounds  are  nearly 
achieved,  even  though  there  is  room  for  improvement  for  other  values  of  k. 
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1.  Introduction. 

In  considering  portions  of  larger,  but  still  finite  strings  of  random 
variables,  the  following  problem  arose.  If  X^,  ...,  Xfl  is  part  of  a  stationary 
sequence  of  zeros  and  ones,  one  would  not  expect  the  ones  within  that  portion 
to  dump  together,  intuitively  because  each  X^  is  as  likely  as  any  other  to 
have  the  value  one.  Based  on  that  intuitive  argument,  one  could  expect  the 
fV*  *  •+Xk4m-l) 

expression  sup  En< — y— - -= - V  (note:  0/0  ■  0)  where  1  s  k  s  k+m-l  s  n, 

PcS  p[  V*"  n  J 

and  S  is  the  set  of  stationary  probability  measures  on  binary  sequences,  to 
behave  roughly  like  m/n.  Indeed,  if  the  probability  P  is  restricted  to  be 
i.i.d.  or  even  exchangeable,  a  simple  symmetry  argument  yields  a  sup remum  of 
m/n,  achieved  when  the  are  identically  1.  For  the  case  of  stationarity, 
the  upper  bounds  on  the  supremum  for  m/n  small  are  roughly  2m/n  when  k  is  near 
n/2,  and  like  (m/n) log  n  for  k  closer  to  1  or  n  (thm.  7).  The  key  result  is 
a  constructive  proof  that  finds  the  P  which  achieves  the  supremum  for  the  two 
cases  of  b  ■  1,  k  ■  1,  and  m  ■  1,  k  «  (n+l)/2  (thm.  2). 

I  would  like  to  thank  Professor  Michael  Steele  for  insisting  that  this 
could  be  done,  and  Professor  Larry  Shepp  for  an  improvement  in  the  proof. 

I  would  also  like  to  acknowledge  the  many  simplifications  and  improvements 
suggested  by  the  referee. 


2.  Results. 

We  shall  immediately  narrow  our  concern  to  the  simpler  problem  of  finding 
bounds  for 


for  1  i  k  s  n. 


(1) 


2 


Notice  that  the  variables  w  w  -  do  not  appear  in  the  above  expression, 
so  only  the  marginal  distribution  of  (X.,  ....  X_)  affects  the  values  of  R.  . 

i  il  K  |D 

A  small  amount  of  notation  is  needed  for  the  next  theorem,  which  makes  use  of 
this  observation. 

A  loop  is  a  finite  sequence  a.,  ....  a.  of  zeros  and  ones.  Subscripts 
out  of  range  will  be  taken  circularly,  so  that  aQ  »  am>  and  a^j  »  a^.  For 
a  loop  a  and  any  positive  integer  n,  the  measure  P  gives  mass  1/m  to  each 
of  («j,  ....  an),  (a2,  ....  «n>1)»  ....  (a^.  ....  an+n_j). 

Theorem  1. 

If  a  binary  sequence  X  has  a  stationary  distribution,  then  the  marginal 

distribution  of  (X^,  ....  XR)  can  be  written  as  a  convex  combination  of 

measures  P _  for  a  €  A  ,  where  A  is  a  finite  set  of  loops.  Moreover,  every 

a.n  n  n  r  ' 

P _  is  the  marginal  of  some  infinite  stationary  distribution. 

a|ll 

More  details,  and  a  proof  of  this  can  be  found  in  Zaman  (1983)  or  Hobby 
and  Ylvasaker  (1964).  Since  expectation  is  a  linear  functional,  thm.  1  allows 
replacing  the  maximization  over  S  in  eq.  1  by  maximization  over  P  for 

S  fU 

a  e  An,  yielding 

V-  ■  v/vS-t  v-  (2) 

Using  the  definition  of  P  ,  the  expectation  can  be  further  decomposed  into 

ftp  Is 

EP  _(V^3-1  V  "  a  ^i-l(ai*k/^”-l  aisj3  (3) 

•  pH 

where  a  is  the  length  of  the  loop  a.  In  a  completely  unrelated  problem,  sums 
of  the  form  given  in  the  right  side  of  eq.  3  have  been  given  the  name  cyclic 
sums,  e.g.  Day kin  (1970). 
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Equations  2  and  3  convert  the  original  probability  problem  of  eq.  1  into 
a  finite  maximization  of  a  function  over  a  set  of  loops.  This  maximization 
is  performed  foT  chosen  values  of  k  in  the  appendix  to  prove  the  following 
key  theorem. 


Theorem  2. 

t  A 

(a)  When  k  «  1  or  n,  the  maximum  in  eq.  2  is  achieved  for  a  ■  0n'*lp 
(the  notation  0n~2  refers  to  a  block  of  n-1  zeros)  for  some  number  8  depending 
on  n. 


(b)  When  k 


a  * 


„k-l 


1. 


(n*l)/2  for  odd  n,  the  maximum  in  eq.  2  is  achieved  for 


Corollary  3. 

Define 

a(n)  *  sup(n*8)"1  jf  .  1/i. 
tel  11 

Then, 

a(n-l)  if  k  *>  1  or  n  (a) 

2/(n+l)  if  k  ■  (n+l)/2  (b) 


(4) 


The  corollary  is  actually  proved  as  a  step  in  proving  thm.  2,  but  can 
also  be  proved  by  writing  out  eq.  3  for  the  loops  given  in  thm.  2. 

Using  these  equalities  for  Rj  and  n'  a  Ranera*  bound  for  ^ 
is  easy  to  get.  Theorems  4  and  5  do  just  that.  The  bounds  of  thm.  4  are  de¬ 
picted  graphically  in  fig.  1. 


Theorem  4. 

Define 

o(k,n)  ■  sup  (k*8)“1((n-k)/8  ♦  jf"*  .  1/1). 
n-ks$ 


y 


Fig.  1:  Bounds  on  R,  as  a  function  of  k,  for  n  =  1 0 1 

k  i  n 

The  area  between  the  upper  and  lower  bounds  of  thin.  4  i 
shaded  to  indicate  the  possible  region  for  R^  .  The 

different  bounds  arc  labelled  by  the  equation  number  in 
thm.  4. 
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(a)  o(n-k,  n)  s  R^n  s  o(n-k) 

(b)  a(k-l,  n)  s  s  o(k-l) 

(c)  l/(n*l-k)  s  R^n  s  1/k 

(d)  1/k  s  s  l/(n*l-k) 


when  2k- 1  s  n 
when  2k- 1  *  n 
when  2k-l  s  n 
when  2k- 1  2  n. 


Proof: 


Parts  (b)  and  (d)  follow  from  (a)  and  (c)  respectively,  once  the  symmetry 


condition 

*k,n  s  Rn-k*l,n 

is  established.  To  prove  this,  note  that  if  P  is  the  distribution  of 

ft 

(X,,  ....  X  )  then  the  distribution  of  (X  .  ...,  X.)  is  given  by  P  ,  for 
in  n  i  »  ,n 

a'  ■  (a  .  a,).  Now  for  any  loop  a, 

ID  1 

V  <V^.i  V  *  Epa.  V 

ft>n  *  »■* 

from  which  eq.  5  follows. 

The  upper  bound  in  (a)  follows  from  Cor.  3a  by 

5  W^.k  Xj>  ’  "l.n.l-k  * 

Similarly,  for  part  (c),  the  result  of  Cor.  3b  shows  that  for  2k-l  s  n 

*M  S  Wlf.'l  XJ>  '  «k,2k-l  -  ,/k- 


The  lower  bounds  have  been  included  in  the  theorem  to  get  some  idea  on 
the  room  for  improvement  of  these  bounds.  It  is  conjectured  that  the  actual 
values  of  JL  are  much  closer  to  the  lower  bounds  than  to  the  upper  bounds. 

R  pX* 

The  lower  boiaid  (a)  is  obtained  by  using  eq.  3  to  get  for  k  £  (n*l)/2 
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v2 

a»0  lp 
k£0Sn 

*  sup  (n*0-k)_1[(k-l)/a  ♦  if ,k  1/iJ. 
k£0£n  1  * 

-  V 

The  lower  bound  in  (c)  is  achieved  by  letting  a  ■  01.  For  that  value  of 
a,  if  2k-l  s  n  then  by  eq.  3 

VV  J-i  V  *  h£it  • 

It  is  not  difficult  to  find  loops  which  give  even  higher  lower  bounds, 
but  that  does  not  seem  to  be  the  more  fruitful  direction  of  moving  the  bounds.  □ 

Theorem  S. 

s  taaa-a  for  n * 3. 

K,n  n 

Before  giving  a  proof,  a  logarithmic  approximation  for  the  function  a 
will  be  established. 

Lemma  6. 

lot  n_-  log(log  n)  -  l  s  a(n)  s  logj. 


Proof: 

Let  0*  be  a  value  of  0  which  achieves  the  maximum  in  eq.  4,  so  that 

o(n)  -  (nee*)'1  1/i.  (6) 

A  crude  bound  to  the  harmonic  series  in  eq.  6  gives 

«(n)  s  (1+log  0*)/(ne0*).  (7) 


By  calculus,  the  function  (1+log  x)/(n+x)  for  x  *  1  reaches  its  maxinun 
value  of  (log  x*)/n  when  x*  log  x*  ■  n.  If  n  >  e,  log  x*  can  be  bounded  by 

log  n  -  log  log  n  s  log  x*  *  log  n.  (8) 

Plugging  this  information  about  the  maximum  into  eq.  7 

o(n)  s  (1+log  B*)/(n+B*)  &  (log  x*)/n  s  (log  n)/n, 

establishing  the  second  inequality  of  the  lemna. 

For  the  first  inequality,  let  x*  be  as  before,  define  B  *  [x*]  (the 
integer  part),  and  for  notational  convenience  let  £  »  log  n  -  log  log  n  which 
is  the  term  on  the  left  side  of  eq.  8.  Then 

a(n)  2  (n+B)"1  jJel  1/i  *  (n+x*)'1  log  x* 

2  (n  +  n/t)"1*  «  n_1£2/(l+t)  •  n'1[£-l+(t+l)"1)  (9) 

>  (£-l)/n. 

The  last  inequality  substitutes  a  prettier  expression  at  the  cost  of  some 
precision.  □ 

The  proof  of  tha.  5  then  amounts  to  the  following.  By  eq.  5 


max 

kS(n+l)/2 


(by  thm.  4a,  c)  s  max  {(1/k)  a  a(n-k)} 

ks(n+l)/2 


(by  lem.  6)  s  max  < (1/k)  a  log(n-k)/(n-k)> 

ks(n+l)/2 


(10) 


Since  1/k  is  decreasing  and  the  second  function  increasing  as  k  increases,  the 
maximum  in  eq.  10  is  attained  at  some  k  *  k*  for  which  the  two  functions  are 
equal.  Thus 
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max  R.  „  •  1/k*  ■  log(n-k*)/(n-k*) 
k 

■  [!♦  log{n-k*)}/n» 


where  the  last  expression  follows,  by  some  algebra.  Since  k*  i  i,  replacing 
it  by  1  gives  the  claimed  result  in  thm.  5.  0 

Returning  to  the  original  problem  as  stated  in  the  introduction,  one  can 
state  the  following  theorem  based  only  on  the  definition  of  R^ 


Theorem  7. 


k+o- 
sup  Ep  l 
P«S  *  jak 


« 

o-l  /n  k*m-l 

I  Xj/I  xj  '  1  l  Rj  n- 
ak  j*l  JJ  jak 


For  example,  this  proves  that  for  any  stationary  measure  P, 


Ep  \  s  (m/n)  [1  * 

1  *  *  n 


and  for  blocks  near  the  middle 


E>n~:;r *  srr  ♦ 2  “*<&> s 

-n  n 


by  using  the  values  of  R^  n  given  in  theorems  5  and  4c,  d. 


aaers,. 


9 


f 


( 


APPENDIX 

Proof  of  Theorem  2a. 

The  appendix  will  use  eqns.  2,  3,  4,  5  and  lem.  6  from  the  previous 

section.  It  is  to  be  noted  that  these  do  not  use  thn.  2  in  any  way  and  are 

mainly  definitional  equations.  To  avoid  repeating  awkward  summations,  for 

the  loop  a  ■  a, ,  ....  we  define 
a  n 

SO.  k)  - 

Si  ■  S(i-n*l,  i) 

Ti  *  VSi 

TO.  *>  -  l\aj  T.. 

By  eq.  5,  Rj  n  «  Rn>n-  We  will  choose  to  work  with  P  n  for  which  eq.  3 
can  be  written  as 


Ep  (y£j.i  y  * T(i* m)/n- 

A|R 


(A.l) 


Consider  the  case  when  a  is  of  the  special  form  0n-1lx  for  some  integer 
x.  Working  out  the  sums  involved  in  eq.  A.l,  for  this  a 


EP  cylj.l  Xj)  >  (n-Lx)'^  1/i 


a,n 


(A.  2) 


s  a(n-l) , 


It  is  easy  to  see  that  in  eq.  A. 2  equality  is  achieved  for  some  value  of  x  s  n 
which  we  shall  denote  by  B(n-l)  (the  argument  n-1  will  be  assumed  from  now  on). 
The  proof  that  amongst  the  set  of  all  loops,  the  given  loop  0n_1l8  maximizes 
the  expectation  will  be  done  by  contradiction.  Assume  there  is  some 


4 

I 
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fff 

4  > 

I 


i  a 


a  «  a.,  . . . ,  a_  and  e  >  0,  for  which 

l  ID 

T(l,  m)/m  >  a(n-l)  ♦  e.  (A. 3) 

The  method  of  proof  involves  a  stepwise  modification  of  a.  At  each  step  the 
previous  sequence  will  be  denoted  by  a,  and  the  modified  one  by  a'.  The 
variables  m',  for  the  length  of  a ",  as  well  as  S'  and  T'  will  similarly  be 
defined  foT  a".  After  each  step,  for  the  modified  sequence  the  inequality 

T'(l.  o')/m'  >  o(n-l)  (A.4) 

will  be  proved.  Yet  after  a  finite  number  of  steps,  the  sequence  a'  will 
essentially  look  like  0  41  ,  providing  the  contradiction. 

Step  1. 

Let  m'  be  a  multiple  of  m,  large  enough  so  that  n/m'  <  e,  for  the  e  in 

eq.  A. 3,  and  also  o'  >  5n  (this  last  restriction  is  not  necessary,  but  allows 

the  treatment  of  a  loop  as  a  long  open  string) . 

We  have  a  ■  a. ,  ....  a  . 

l  m 

Let  a'  *  0n  *a  ,  . , . ,  a 
n  m 

To  prove  eq.  A.4  note  that  af  s  a^,  so  Sr  s  S^.  So  for  i  «  n,  ...,  m' 

we  have  T'  £  T^,  and  for  i  »  1,  ....  n-1,  T.  s  1.  Hence 

T(l,  o')  s  (n-1)  ♦  T'(n,  o'). 

Since  m'  is  a  multiple  of  m, 

o(n-l)  ♦  c  <  T(l,  m)/m 
-  T(l,  m')/m' 
s  ((n-1)  ♦  T'(l,  ra'))/m' 
s  c  ♦  T'(l,  »')/»' 

which  proves  eq.  A.4. 


11 


Step  2. 

Now  a  ■  0n”*an>  •  ••»  affl.  Define  b  ■  S(n,  2n-l). 

Let  a'  -  0n'1lb0n“ba  ....  a  . 

<n  m 

Note  that  a'  is  simply  a,  with  the  block  aR,  ....  a^n_ .  rearranged  so 
that  all  of  its  b  ones  are  to  the  left  of  its  zeros.  We  pause  to  prove  the 
following  lemma  about  switching  the  order  of  a  neighboring  pair  of  0  and  1. 


Lemma  8. 

Let  a  and  a"  be  two  loops  of  the  same  length  m,  identical  except  that 
an.J  "  *n.j.l  *  0  “4  *  *n.j.l  *  »•  lf  “j.i  ’  °-  *»■> 

T(l,  m)  s  T'(l,  m). 


Proof: 


The  proof  consists  simply  of  noting  that  the  only  difference  between 

i  is  T2n*j  *  T2n*j*  Tn*j  *  Tn»j*l  and  Tn*j*l  *  Tn+ j *  0 
Applying  lem.  8  repeatedly  over  a  large  block  yields 


Corollary  9. 

If  a  has  a  block  of  zeros  a^j  *  ...  »  a^  *  0  then  construct  a'  by 
rearranging  the  block  an4.j»  •  •  • »  an4j45  s0  that  ^  ones  ar*  to  the  left  of 
the  zeros,  but  otherwise  a  and  a'  are  identical.  Then  the  conclusion  of  lem.  8 
is  still  valid. 

Returning  to  step  2  in  the  construction,  o(n«l)  <  T(l,  m)/m  s  T'(l,  m')/m', 
where  the  first  inequality  was  established  in  step  1,  the  second  follows  from 
cor.  9. 


Step  3. 


Now  a  »  0n‘llb0n"ba2n,  ....  aB- 

Let  a'  m  on  llB0n  ba2n . am 

so  that  m'  ■  m  ♦  6  -  b. 


By  the  definition  of  0  in  eq.  A. 2, 


TO.  n*b-l)  -  l\ml  l/i  S  (n»b-l)a(n-l) 
T"(l.  n*B-l)  ■  l/i  ■  (n+B-l)o(n-l). 


(A.  5) 


For  the  reaaining  values  i  ■  n*b,  ....  m  we  have  if  M  b-1. 

When  g  <  b-1  the  only  difference  is  that  Si  >  s£+g_b  for  i  *  2n,  ....  2n«>b-6-2, 
so  that  in  all  cases 


T(n*b,  m)  s  T'(n*B,  m') . 


(A.  6) 


Combining  eqns.  A. 5  and  A. 6 

TO,  «)  -  T*(l,  m')  i  (b-B)o(n-l) . 

This  implies  eq.  A. 4  as  can  be  seen  by  this  simple  lemma. 

Lemma  10. 

If  TO,  ■)  -  T  (1,  o')  S  (m-o')a  and  TO,  ®)/®  >  «  then  T'(l»  m')/m'  >  a. 
Proof: 

0  <  T(l,  m)  -  ma  <  T'O,  o')  -  m'a.  □ 

Step  4. 

If  b  >  8.  return  to  step  2;  otherwise  n-b  2  n-8,  so  the  second  block  of 

zeros  in  a  has  at  least  n-B  elements.  Let  a.  be  the  first  occurrence  of  a  1 

c 

in  a2n+B-l'  '*•*  V 

Now  a  -  0n_1l60n“ea2n,  ....  am< 

Let  a'  ■  On'1l0On'1ac,  ....  a^, 

so  that  m'  -  m*2n*8-c-l. 
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Note  that  T(l,  2n-l)  ■  T'(l,  2n*B-2),  T2n+B„i  *  1  and 
T(c-*1,  m)  ■  T'(2n*0,  m')  so  that 

T(l,  m)  -  T'(l,  o')  s  T(2n,  2n^8-2)  ♦  T  -  1.  (A.7) 

Let  d  *  S(2n,  2n+8-2)  so  that  there  are  n-d-1  zeros  in  aR,  ...»  *2n+&-2‘ 
Then  each  for  i  «  2n,  . 2n+0-2  sums  at  most  n-d-1  zeros,  and  at  least 
d*l  ones,  i.e.,  each  a  d+1.  Since  and  hence  is  nonzero  d  times  for 
i  •  2n,  ....  2n+0-2 

T(2n,  2n*6-2)  S  d/(d«-l).  (A. 8) 

He  will  separate  out  three  cases,  and  in  each  case  establish 

T(l,  m)  -  T'(l,  m')  s  (m-m')a(n-l) ,  (A.9) 

which  would  imply  eq.  A. 4  by  lemma  10. 


Table  1: 


n  fl(n-l) 


a(n-l) 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 


1 

1 

2 

3 

3 

4 

4 

5 

5 

6 
6 
6 
7 

7 

8 


1.000000 

.500000 

.375000 

.305556 

.261905 

.231481 

.208333 

.190278 

.175641 

.163333 

.153125 

.144118 

.136466 

.129643 

.123539 


a(n-l)  a  [n-l+CB-Ur1!?^  1/i 

*  (£!&)»<"-»  -  M^r]  (1/B) 

which  gives  a(n-l)  S  1/6.  Since  6<*(n-l)  S  1  and  (8-l)/0  i  1, 

(n*l-e)a(n-l)  -  (6-l)/8  2  (n*l)a(n-l)  -  2 
(by  1«.  6)  i  („.l)  .  2 

2  0  for  n  2  87. 

The  final  inequality  can  be  calculated  for  n  «*  87,  and  since  the  penulti¬ 
mate  expression  is  an  increasing  function  of  n,  all  larger  n  must  also  satisfy 
it.  But  this  establishes  eq.  A.  10  and  hence  A. 9  for  all  n  *  4,  6,  8  or  10. 

Case  3:  c  2  3n  and  n  ■  4,  6,  8.  or  10. 

This  case  is  further  broken  into  three  subcases  each  involving  a  verifi¬ 


cation  by  table  1. 


IS 


(3a)  If  c  *  3n  and  S  >  1  then  T  s  1/2  so  if 

c  c 

(0-l)/0  -  1/2  s  (n+l-0)<*(n-l) 
then  eq.  A. 9  is  satisfied. 

(3b)  If  c  *  3n  and  S  *  1  then  S(2n*l,  2n«-$-2)  =  0,  and  so 

V 

T(2n,  2n*0-2)  *  ?2n  s  1/0.  Using  this  in  eq.  A. 7,  we  need  to  verify 

1/6  £  (n*l-0)a(n-l). 

(3c)  If  c  >  3n  then  m-m'  fc  n*2-0  and  we  need 

(6-l)/0  £  (n*2-0)o(n-l). 

As  these  cases  are  exhaustive,  and  in  each  case  eq.  A. 4  is  tnie,  Step  4 
is  complete. 


At  every  return  to  step  2,  some  elements  of  the  original  sequence  are 
deleted  or  reordered  into  blocks  of  0n~*l^.  Since  no  new  disordered  elements 
are  created  at  any  step,  the  procedure  oust  stop  after  a  finite  number  of 
steps.  Since  at  each  step  eq.  A. 4  was  verified,  for  the  final  a  of  eq.  A. 11 
we  oust  have 


T(l,  n)/m  >  a(n-l) 
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yet  simply  computing, 

T(l.  m)/n  -  (n-l*0)"1  1/i  -  o(n-l) 

providing  the  contradiction  which  proves  the  theorem.  □ 

Proof  of  Theorem  2b. 

Let  n  be  odd,  k  *  (n*l)/2,  and  a  •  a^,  affl.  As  notation, 
define 

so.  n  - !{;,  s 

Ti  *  ai*k^s*1*1» 

to  »  n  >  t4 

so  that  eq.  3  can  be  written  as 

Ep  (y l”nl  X^)  *  T(l,  m)/m. 

& 

For  any  loop  a, 

TO.  *)  »  ti9l  ai+k/s(i*l.  i*n)  S  iJ.j  a.tk/S(W,  n*l)  •  1. 
As  this  holds  for  all  loops,  it  will  also  hold  for  the  loop 

<*hk.l'  V.J . W  fM  “T  int,Mr  h'  T1"1 

T(hk*l,  (h*l)k)  i  1  for  h  »  0,  1,  2,  ...  . 

Adding  these  up  for  h  *  0,  1,  ...,  m-1. 


m  >  T(hk*l) ,  (h*l)k)  •  T(l,  mk)  -  kT(l,  m) , 


(A.  12) 


because  a  is  periodic  with  period  m.  Rewriting  A. 12  gives 


T(l,  a)/n  s  1/k  «  2/(n*l)  (A. 13) 

for  any  loop  a.  On  the  other  hand,  it  is  straightforward  to  verify  that  the 
loop  a  *  0k'll  achieves  the  upper  bound  in  eq.  A. 13,  thus  proving  thm.  2b  and 
cor.  3b  simultaneously.  □ 


Bibliography 


Daykin,  D.E.  (1970)  "Inequalities  for  certain  cyclic  sums",  Proc.  Edin. 
Math.  Soc.  (Ser.  2),  Vol.  17,  pp.  257-262. 

Hobby,  C.  and  Ylvasaker,  D.  (1964)  "Some  structure  theorems  for  stationary 
probability  measures  on  finite  state  sequences",  Ann.  Math.  Stat . , 

Vol.  35,  pp.  5S0-5S6. 

Zaman,  A.  (1983)  "Stationarity  on  finite  strings  and  shift  register  se¬ 
quences",  Ann.  Prob.,  August,  1983  (To  appear). 


1 

FSU  M655 

USARO  D-60 

Unclassified 

SIFICATION  OF  THIS  PAGE 


mimmmivimMWA 


Arif  Zaman 


Department  of  Statistics 
Florida  State  University 
Tallahassee,  FL  32306 


i.  CONTROLLING  OFFICE  NA'iE  AND  APuRESS 
U.S.  Army  Research  Office  -  Durham 
P.0.  Box  12211 

Research  Triangle  Park,  NC  27709 


n 


FSU  Statistics  Report  No.  M655 


USARO  Grant  DAAG29-82-K-0168 


><  WORK  UNIT  HUBERS 


Mi :  ruliiai  a  *  ii!  Ml  ftVi  W.U  id 


HESSKEkSK'  ; 

t  MMI m  S^Bt;  ag 


of  this  report 


R I BUT I ON  STATEMENT  (of  this  report 
Distribution  unlimited. 


e  abstract  onteroo  in  hioc 


kill  'J  i  *1 


laaraaiaiu 


the  view,  opinions.  AN-r/on  hvdihos  conta,.^ -3^  AS 
C? r.ic .%v:;  pcucv.  or  d* 

AM  0WC!*a  ciPA-^.^;-E&  DY  ova=a  i^cum^ta.ion. 


Clustering,  Stationary  Sequences,  Cyclic  Sums 


im*  tmumi* 


tr.pm««CT^MmTai 


ontinuc  on  reverse  sine  if  necessary  ano  identify 

"^>For  a  random  sequence  of  events,  with  indicator  variables  X^,  the  behavior  of 
the  expectation  .^+0^)]  for  \(±;  k (it,  k+m- 1  (Sj n  can  be  taken 

as  a  measure  of  clustering  of  the  events.  Nhen  the  measure  on  the  X's  is  i.i.d., 
or  even  exchangeable,  a  symmetry  argument  shows  that  the  expectation  can  be  no  more 
than  m/n.  Nhen  the  X's  are  constrained  only  to  be  a  stationary  sequence,  the  bound 
deteriorates,  and  depends  on  k  as  well.  Nhen  m/n  is  small,  the  bound  is  roughly 
2m/n  for  k  near  n/2  and  is  like  (m/n)  log  n  for  k  near  1  or  n.  The  proof  given  is 
partly  constructive,  so  these  bounds  are  nearly  achieved,  even  though  there  is  room 
for  improvement  for  other  values  of  k. 


*+amr  ^  ?*&  s' 


■frwjflpr 


